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ABSTRACT 

The  issues  associated  with  the  maximal  exploitation  of  space-based  surveillance  resources  are 
unique  due  to  the  nature  of  the  platforms,  their  sensors  and  intermittent  communications 
links  to  the  ground  station  for  target  information  processing  and  sensor  tasking.  This  report 
apphes  the  GAMBIT  formalism  [Berry  &  Fogg  (1)]  to  the  networked  sensor  decision  problem 
of  determining  the  optimal  allocation  of  space-based  surveillance  resources  for  the  purpose  of 
detecting,  locating  and  tracking  surface  maritime  targets.  As  well  as  being  an  issue  of  interest 
in  its  own  right,  this  application  is  an  instance  of  the  more  general  surveillance  asset  allocation 
problem. 
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optimal  Search,  Location  and  Tracking  of 
Surface  Maritime  Targets  by  a  Constellation  of 
Surveillance  Satellites 


Executive  Summary 

The  key  to  integrating  surveillance  operations  and  architectures  is  the  allocation  and 
tasking  of  multiple  surveillance  assets  for  the  purpose  of  collectively  satisfying 
specified  surveillance  information  requirements  subject  to  time,  space  and  resource 
constraints.  This  is  a  classical  networked  sensor  decision  problem.  A  generic 
framework  (GAMBIT)  for  addressing  this  issue  was  developed  by  Berry  &  Fogg  [1], 
which  assumes  that  the  supporting  technologies  in  terms  of  coimnunications 
bandwidth  and  processing  power  are  in  place  and  are  represented  as  information 
constraints.  The  issues  associated  with  the  maximal  exploitation  of  space-based 
surveillance  assets  are  unique  due  to  the  nature  of  the  platforms,  their  sensors  and  the 
communications  links  to  the  groimd  station  for  the  processing  of  target  information 
and  the  tasking  of  the  sensors.  This  report  applies  the  GAMBIT  formalism  [1]  to  the 
problem  of  determining  the  optimal  allocation  of  space-based  surveillance  resources 
for  the  purpose  of  detecting,  locating  and  tracking  surface  maritime  targets. 
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1.  Introduction 


The  advancing  capabilities  of  sensors,  continued  miniaturisation  of  satellite  payloads 
and  reducing  costs  of  satellite  laimch  make  surveillance  from  space  an  increasingly 
attractive  option.  However,  the  effective  utilisation  of  the  surveillance  resource 
provided  by  space-based  platforms  requires  that  the  data  originating  from  their 
sensors  be  processed  appropriately  and  that  the  sensors  themselves,  which  provide  the 
data,  be  tasked  appropriately.  For  optimal  resource  utilisation,  the  retrospective 
exploitation  of  information  and  the  prospective  management  of  sensors  need  to  be 
integrated  into  a  single  procedure. 

This  is  a  specific  instance  of  the  more  general  integrated  surveillance  problem,  which  is 
to  determine  how  to  allocate  and  task  multiple  surveillance  assets  to  meet  specified 
surveillance  information  requirements  subject  to  time,  space  and  resource  constraints, 
while  achieving  efficiency  and  taking  into  account  uncertainty  arising  from 
environmental  effects,  xmpredictable  target  motions  and  imperfect  sensors.  Thus  many 
of  the  practical  implementation  issues  are  addressed  in  this  report  and  can  be  extended 
to  the  general  problem. 

An  optimal  formulation  of  the  problem  based  upon  Bayesian  estimation  and  an 
entropy  measure  of  surveillance  effectiveness  has  been  stated  and  a  procedure 
proposed  called  GAMBIT  (Berry  &  Fogg  [1]).  This  was  developed  for  the  purpose  of 
integrating  a  range  of  disparate  surveillance  sensor  systems  but  is  applied  here  to  the 
specific  problem  of  managing  the  sensors  on  a  constellation  of  surveillance  satellites  for 
search  operations  (i.e.,  target  detection),  target  location  estimation  and  tracking  of 
targets  in  the  maritime  domain.  Bayesian  approaches  to  tracking  are  gradually 
increasing  in  popularity  (Stone  et  al  [2]  and  Kastella  [3])  and  will  continue  to  do  so  due 
to  the  emergence  of  efficient  particle  filtering  techniques  for  Bayesian  estimation.  A 
number  of  recent  papers  have  begun  to  explore  the  use  of  entropy  for  target  tracking 
such  as  Xianxing  et  al  [4],  Fogg  [6]  and  Xianxing  et  al  [7]. 

2.  General  Statement  of  the  Problem 


The  problem  of  tracking  maritime  surface  targets  from  space  is  unique  in  the  field  of 
target  tracking  because  of  the  fact  that  sensors  do  not  scan  an  area  at  equally  spaced 
intervals  of  time.  Thus,  sensor  data  can  only  be  downloaded  for  processing  during 
specific  windows  of  opportunity  when  a  satellite  has  access  to  a  ground  station. 
Furthermore  the  satellite  sensors  can  only  be  tasked  during  such  access  periods.  Hence 
there  are  delays  in  processing  and  tasking,  which,  though  predictable,  have  to  be  taken 
into  account  in  the  tasking  computations.  This  assumes,  of  course,  a  particular  system 
architecture  in  which  processing  and  tasking  is  performed  on  the  ground.  For 
architectures  involving  on-board  processing  and  direct  communication  between 
surveillance  satellites  the  delays  could  be  reduced. 
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By  tasking  of  a  satellite  is  implied  the  choice  of  sensor  from  a  sensor  suite,  its  look-angle 
and  its  mode  of  operation  and  hence  resolution.  The  fact  that  the  area  of  interest  is 
generally  quite  large  (especially  for  search  operations),  revisit  times  potentially  quite 
long,  target  motions  inherently  unpredictable  and  the  environment  obscuring  or 
attenuating  target  signals  means  that  account  must  be  taken  of  the  uncertainty 
associated  with  information.  Hence  the  GAMBIT  formalism  [1]  would  seem  to  be 
appropriate  for  this  problem.  It  would  also  be  desirable  to  take  account  of  any  prior 
information  regarding  the  existence  and  nature  of  a  specific  target,  or  mformation 
about  targets  in  general,  and  our  ability  to  observe  them,  as  this  becomes  available,  in 
order  to  assist  in  the  execution  of  a  space-based  surveillance  operation  e.g.  a  priori 
target  normalcy  data  (such  as  shipping  lanes),  information  originating  from  an 
intelligence  source  about  the  time  of  departure  of  a  particular  target  of  interest  from  a 
port,  meteorological  information  on  cloud  cover  (in  the  case  of  EO  sensors),  and  target 
speed  and  direction  estimates  extracted  from  imagery  or  track  and  constrained  by  the 
presence  of  islands  and  land  masses. 

How  a  space-based  surveillance  operation  is  executed  also  depends  crucially  on  what 
information  is  sought  e.g.  whether  the  objective  is  to  determine  the  existence  of  all 
targets  in  a  specific  area,  to  track  all  targets  entering  a  given  area  or  crossing  a  given 
line,  or  to  track  and,  ultimately,  classify  and  identify  all  targets  conforming  to  a 
particular  set  of  characteristics  expressed  in  terms  of  location,  speed,  direction,  origin 
and  destination.  These  sets  of  characteristics  may  be  expressed  as  hypotheses  defined 
probabilistically. 

It  may  also  be  useful  to  maintain  mutually  exclusive  hypotheses  regarding  a  target  for 
the  primary  purpose  of  eliminating  it  from  consideration,  but  hedging  one's  bets  by 
maintaining  the  option  of  admitting  it  at  a  later  point  in  time.  There  is  of  course  a 
computational  cost  in  maintaining  alternative  hypotheses  but  it  provides  the  flexibility 
to  change  beliefs  in  a  rapidly  changing  situation.  By  belief  is  meant  the  choice  of  one 
hypothesis  versus  its  competitors  for  the  purpose  of  making  a  decision,  which  has  a 
physical  outcome,  such  as  tasking  a  sensor.  If  no  physical  action  is  required  then  all 
possible  hypotheses  can  be  maintained  within  available  computational  resources. 
When  a  choice  has  to  be  made  between  hypotheses  then  it  must  be  done  rationally  i.e., 
its  consequence  must  maximise  the  expected  utility  of  the  information  collected  in  the 
context  of  the  overall  surveillance  information  requirement.  Sometimes  a  compromise 
will  be  possible  and  a  decision  will  be  selected  because  it  will  assist  in  discriminating 
between  competing  hypotheses. 

2.1  Assumptions 

A  constellation  of  surveillance  satellites  is  used  to  search  for,  locate  and  track  a  ship. 
By  use  of  the  term  locate  is  meant  an  improved  estimate  of  a  previously  detected 
target's  location  and  which  is  sufficient  for  it  to  be  tracked.  The  following  assumptions 
are  made; 
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•  The  satellites'  sensors  have  a  look  angle  that  enables  them  to  sweep  a  specified 
swath  (within  their  field  of  regard) 

•  The  satellites'  sensors  have  discrete  modes  of  operation,  each  yielding  a 
particular  resolution  and  hence  swath  width,  from  which  can  be  deduced  target 
probabilities  of  detection  and  false  alarm 

•  A  detection  event  could  correspond  to  an  actual  target  or  could  be  a  false  alarm. 
An  estimate  of  target  location  and  velocity  is  associated  with  a  detection  and 
incorporates  observation  errors.  The  Bayesian  estimation  approach  can 
incorporate  the  effect  of  land  through  a  likelihood  function  by  associating  the 
existence  of  ships  only  with  areas  of  ocean  for  detection. 

•  A  target  motion  model  is  used  to  predict  future  target  locations  (actual  and 
false)  for  the  purpose  of  tasking  satellite  sensors  on  future  fly-overs.  This  takes 
account  of  the  hkely  behaviour  of  the  target  of  interest,  the  vicinity  of  ports  to 
where  it  may  be  heading,  and  the  vicinity  of  land  and  of  straits  between  islands. 
This  is  effectively  a  filter  that  eliminates  targets  that  are  unlikely  to  be  of 
interest  on  the  basis  of  their  dynamical  characteristics. 

•  For  the  future  fly-over  of  a  sateUite  and  predicted  target  locations,  its  sensor  is 
tasked  optimally  to  collect  the  required  information  subject  to  the  sensor 
capability  constraints  (incorporating  environmental  effects) 

•  In  order  for  observations  to  be  utilised,  accoimt  is  taken  of  the  need  to 
communicate  the  data  from  a  satellite  to  a  ground  station,  to  process  it  and  to 
communicate  it  to  the  appropriate  satellite  from  a  ground  station. 

Three  cases  are  proposed:  a  deterministic  model  of  the  target  motion  and  sensor 
tasking,  a  Gauss-Markov  target  motion  model  and  a  conservative  target  motion  model. 
Note  that  assumptions  regarding  the  capabilities  of  sensors  may  be  relaxed  but  in 
general  will  relate  to  the  allowable  size  and  shape  of  an  area  of  coverage  on  a  fly-pass 
and  the  corresponding  resolution.  The  filtering  of  information  in  accordance  with 
hypotheses  for  targets  of  interest  can  be  done  for  multiple  hypotheses  concurrently. 
However,  only  one  sensor  action  can  be  selected  although  this  can  be  optimised  for  just 
one  of  the  hypotheses  or  can  aim  to  compromise  by  partially  satisfying  several 
hypotheses. 


3.  General  quantitative  formulation  of  the  problem 

The  region  91  of  interest  is  discretised  into  numbered  cells,  each  cell  of  which  has 
sufficiently  small  area  that  the  probability  of  it  being  occupied  by  two  or  more 
independent  targets  is  negligible.  For  simplicity  the  cells  may  be  defined  by  the 
imposition  of  a  rectangular  grid  upon  91 .  Without  loss  of  generality,  let  there  be  N 
cells  where  the  probability  of  a  cell  being  occupied  by  a  target  at  a  time  t  is 
Pi (t),  Let  the  surveillance  satellites  have  access  to  the  region  91  at 

discrete  instants  of  time  ,  known  in  advance  and  which  will  not,  in 
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general,  be  equally  spaced,  then  we  write  /?'’  s  p.{t,,) .  These  times  are  referred  to  as 
epochs  and  let  the  area  accessed  by  a  satellite  at  the  n*  epoch  be  A,, ,  where  ci  5H  . 
Note  that  p"  may  be  interpreted  to  mean  either  the  probability  of  cell  i  containing  a 
target  at  epoch  n  (as  opposed  to  it  not  containing  a  target),  or  alternatively  the 
probability  that  a  single  target  in  the  region  9^  occupies  cell  i  (as  opposed  to  any  other 
cell).  These  represent  alternative  formulations  of  the  problem  and  which  of  them  is 
relevant  will  depend  upon  the  information  sought,  as  will  be  discussed. 

The  sensors'  performances  are  modelled  in  terms  of  their  probabilities  of  detection,  , 
and  false  alarm,  pj^, .  These  probabilities  will  vary  depending  upon  the  resolution  of 
the  sensor  mode  used  and  the  intervening  environment.  Note  that  the  false  alarm 
probability  per  cell  has  to  be  calculated  from  the  false  alarm  distribution  per  unit 
area  (i.e.,  a  Poisson  distribution  in  2-dimensions)  and  the  area  of  the  cell  i.e., 
Pf^=  Pf^M.  where  AA  is  the  area  of  a  cell.  For  simplicity  it  is  assumed  that  all  cells 
are  equal  in  area  and  that  the  false  alarm  distribution  is  uniform,  hence  p^^  is  spatially 
constant  for  a  given  choice  of  resolution.  The  probability  distribution  is  updated  based 
upon  the  observations  using  Bayes'  rule  using  a  likelihood  function  appropriate  to  the 
way  in  which  the  state  space  is  structured.  A  likelihood  fimction,  as  defined  in 
statistics,  is  the  probability  of  obtaining  a  particular  set  of  measurements  for  a  given 
state  of  the  system  (assuming  discrete  states). 

A  target  motion  update  is  computed  for  predicted  target  motiojis  between  epochs  n 
and  n+1.  Note  that  within  this  cell-based  formulation,  cells  are  assumed  to  be 
independent  and  no  unique  association  is  made  between  measurements  and  targets. 
This  is  adequate  for  determining  the  existence  of  targets  within  91  in  the  search  phase 
of  a  surveillance  operation. 

If  one  begins  to  associate  observations  imiquely  with  a  particular  target  of  interest 
either  because  it  is  known  for  a  fact  that  there  is  only  one  target  in  91  or  because  the 
means  exists  to  distinguish  that  target  from  other  targets  in  91 ,  then  one  can  specify  the 
probability  distribution  for  the  location  of  a  target  given  that  one  and  only  one  such 
target  exists  in  91 .  This  can  be  done  by  conditioning  the  distribution  upon  the 
hypothesis  that  exactly  one  target  exists.  There  will  still  be  location  errors  and 
uncertainty  about  its  location  caused  by  false  alarms  but  these  effects  will  be  captured 
by  the  probability  distribution. 

It  is  easier  to  condition  the  cell-based  distribution  upon  the  hypothesis  of  the  existence 
of  one  target  than  perform  Bayesian  updates  and  target  motion  updates  upon  the 
conditioned  probability  distribution.  It  can  be  shown  that  results  obtained  are 
equivalent  [1]. 

This  probabilistic  formulation  facilitates  a  quantitative  approach  to  area  search  for 
initial  target  detection,  detected  target  location  estimation  and  target  tracking  that  is 
more  general  than  traditional  search  and  tracking  techniques  employing  Kalman 
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filtering  and  assuming  linear  observation  and  target  motion  models  with  Gaussian 
errors  and  target  accelerations.  This  is  made  possible  by  the  fact  that  there  will 
generally  be  adequate  time  between  observations  by  satellite  sensors  to  perform 
optimal  detection  and  estimation  using  nonlinear  filtering.  For  real-time  tracking  of 
targets  with  terrestrial  radars  this  approach  is  not  yet  considered  to  be  feasible 
although  particle  filtering  techniques  for  nonlinear  filtering  are  currently  an  area  of 
active  research. 

This  formulation  also  facilitates  the  optimal  control  of  the  satellite  sensors  in  a 
predictive  sense  in  order  to  maximise  the  expected  information.  This  requires  a 
quantitative  measure  of  information  for  which  we  employ  information  entropy  as 
discussed  in  Berry  &  Fogg  [1]. 


4.  Phases  of  a  Surveillance  Operation 

It  is  assumed  that  a  surveillance  operation  is  broken  down  into  separate  search,  locate 
and  track  phases,  which  are  non-overlapping  in  time  and  have  distinct  transitions  in 
time  between  them.  The  sensor  management  strategy  is  to  be  separately  optimised 
within  each  phase  so  as  to  satisfy  its  distinct  surveillance  information  requirements.  It 
would  be  possible,  in  principle,  to  rmdertake  such  activities  concurrently  using  the 
same  set  of  surveillance  assets  with  appropriate  prioritisation  of  information,  but  this 
more  general  case  is  not  considered  here. 

For  each  of  the  phases  the  information  sought  is  different  and  hence  the  state  space  has 
to  be  structured  appropriately.  In  the  case  of  the  search  phase  it  is  simply  the  existence 
of  targets  within  91  which  is  of  interest.  The  locations  of  the  targets  are  of  interest  only 
for  the  purpose  of  maximising  search  efficiency  by  minimising  multiple  detections  of 
the  same  target.  In  the  location  estimation  phase  it  is  the  locations  of  particular  targets 
of  interest  that  are  of  primary  importance  and  hence  the  state  space  is  structured  to 
reflect  this.  In  the  tracking  phase  a  single  target  of  interest  is  tracked  on  the  basis  that 
observations  made  can  be  distinguished  from  those  of  other  targets  either  on  the  basis 
of  sufficient  spatial  separation  or  differing  target  characteristics. 

For  each  of  the  phases  of  the  surveillance  operation  the  state  spaces  and  probability 
distributions  are  defined  to  be  consistent  with  the  surveillance  information 
requirement.  Bayes'  rule  is  applied  using  an  appropriate  likelihood  fimction  to  update 
the  probability  distributions  in  the  light  of  sensor-derived  information  about  the 
existence  and  locations  of  targets.  This  is  developed  in  the  following  sections. 

For  each  phase  of  the  surveillance  operation,  we  select  sensor  actions  with  the  objective 
of  maximising  the  expected  information,  where  expectation  is  used  in  the  statistical 
sense.  In  general  this  could  involve  directing  sensors  and  their  platforms  over  a  future 
infinite  time  horizon  that,  for  practical  purposes,  would  have  to  be  made  finite  (see 


5 


DSTO-TR-1480 


Berry  [5]).  The  problem  is  simplified  in  the  case  of  a  constellation  of  satellite  sensors 
because  we  do  not  have  control  over  the  platforms  (i.e.,  fixed  constellations)  and  we 
have  the  option  of  delaying  future  tasking  of  sensors  until  the  latest  information  is 
available.  Hence  we  only  need  to  plan  for  the  next  epoch  following  an  observation  at 
the  previous  epoch. 

For  a  given  phase  the  information  requirement  is  specified  in  terms  of  the  state  space  as 
described  previously  and  information  represented  in  terms  of  a  probability  distribution 
over  the  state  space.  The  satisfaction  of  the  information  requirement,  or  equivalently 
the  completeness  of  the  information,  is  represented  quantitatively  by  the  entropy  of  the 
distribution.  The  objective  in  any  phase  of  a  surveillance  operation  is  to  reduce 
information  uncertainty  by  minimising  the  entropy  of  the  probability  distribution 
representing  it.  Since  one  cannot  predict  precisely  what  the  consequence  of  an 
observation  will  be  following  a  sensor  control  action,  the  objective  chosen  is  the 
expectation  of  the  entropy  following  the  future  observations. 


5.  Phase  I:  Search 

It  is  assumed  that  a  surveillance  operation  begins  with  a  search  activity,  whose 
objective  may  be  to  find  all,  some  or  just  one  target  within  a  given  area  based  upon 
some  prior  iirformation  as  to  the  targets'  locations  or  distribution.  It  could,  in  fact,  have 
the  objective  of  confirming  that  there  are  no  targets  in  a  given  area.  In  general  we  will 
consider  that  the  search  phase  of  a  surveillance  operation  has  as  its  objective  the 
determination  of  the  existence  or  non-existence  of  targets  within  a  region. 

The  most  general  formulation  of  the  problem  is  obtained  by  defining  a  joint  probability 
distribution  for  the  numbers  of  targets  in  the  region  and  their  locations  (Berry  &  Fogg 
[1]).  This  may  be  initialised  using  an  a  priori  distribution,  updated  using  Bayes'  rule  in 
response  to  observations  at  specific  instants  of  time  and  evolved  over  time  to  reflect 
target  motion.  This  is  the  most  general  formulation  for  the  representation  of  targets  and 
their  locations  in  the  case  of  targets  not  being  distinguished,  that  is  a  measurement  is 
not  associated  with  any  of  a  set  of  previous  measurements  for  a  target  as  is  the  case  in 
target  tracking.  It  is  the  most  general  formulation  in  the  sense  that  it  allows  for 
correlatiorrs  between  targets  (e.g.  targets  comprising  a  naval  fleet)  and  the  likelihood  of 
any  hypothesis  expressed  in  terms  of  the  existence,  numbers  and  locations  of  targets  in 
the  region  or  a  sub-region  can  be  deduced  from  it.  However,  because  the  state  space 
grows  combinatorially  with  target  numbers  it  is  computationally  inefficient  to  perform 
Bayesian  updates  and  predict  expected  entropies  arising  from  proposed  sensor  actions. 
This  most  general  formulation  of  the  search  problem  is  referred  to  as  the  global 
formulation. 


The  reason  why  it  is  necessary  to  define  a  probability  distribution  over  the  region  (as 
opposed  to  simply  associating  probabilities  with  detections)  is  because  during  a  search 
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activity  one  is  concerned  with  the  existence  of  targets  rather  than  their  numbers  or 
locations.  It  is  therefore  necessary  to  monitor  those  parts  of  a  region  that  have  been 
searched  and  when  in  relation  to  likely  target  speeds  due  to  the  possibility  of  targets 
entering  an  area  previously  searched. 

A  computationally  more  efficient  formulation  of  the  search  problem  [1]  arises  from  the 
assumption  that  targets  are  independent  of  each  other  and  their  distribution  conforms 
to  a  2-D  Poissonian  spatial  distribution  that  becomes  a  2-D  Bernoulli  distribution  when 
discretised.  This  is  implemented  by  discretising  the  region  into  a  grid,  within  each  cell 
of  which  there  is  defined  a  probability  that  it  contains  a  target.  The  cell  size  is  taken  to 
be  sufficiently  small  that  the  probability  of  a  cell  containing  more  than  one  target  is 
negligible.  All  cells  are  considered  to  be  independent,  and  values  for  their  probabilities 
are  updated  independently.  Hence  each  cell  is  considered  to  be  a  state  space  in  its  own 
right  with  the  probabilities  of  a  target  existing  or  not  existing  within  it  summing  to 
unity.  This  is  referred  to  as  the  localised  formulation  of  the  problem.  There  would  be  no 
attempt  to  track  a  target  during  the  search  phase  of  an  operation  since  the  objective  is 
to  associate  a  general  area  within  the  region  with  the  existence  of  a  target  as  quickly  as 
possible.  Location  estimates  will  tend  to  degrade  over  time  as  a  consequence  due  to 
target  motion.  Hypotheses  can  be  derived  directly  from  the  localised  formulation.  It 
can  be  shown  that  rather  than  update  probabilities  conditioned  upon  hypotheses, 
updating  probabilities  in  the  localised  formulation  and  then  conditioning  is  equivalent 
and  easier  to  implement  [1]. 

The  probabilities  for  cells  within  the  swath  A„  are  subjected  to  Bayesian  updates  based 
upon  whether  or  not  a  detection  occurs  in  each  of  them.  The  probabilities 
corresponding  to  cells  outside  of  are  not  changed  since  no  new  information  is 
received  (i.e.,  no  detections  or  non-detections).  It  can  be  demonstrated  [1]  that  the 
entropy  for  the  global  distribution  of  targets  is  equal  to  the  sum  of  the  entropies  for  the 
individual  cells  in  the  localised  formulation. 


A  sensor  action  is  selected  which  minimises  the  expected  entropy  of  the  information 
sought,  for  this  will,  on  average,  most  closely  satisfy  the  information  requirement.  The 
expectation  is  performed  on  the  future  information  based  upon  the  current 
information.  The  sensor  control  action  is  assumed  to  be  the  sensor  look  angle  a ,  the 
value  of  which  determines  uniquely  the  area  of  coverage  Aipc) . 


5.1  Bayesian  update 


If  the  a  priori  probability  for  a  target  in  cell  /  at  epoch  n+1  is  p'fo' ,  and  a  detection 
occurs  in  that  cell  as  a  consequence  of  it  being  observed,  then  application  of  Bayes'  rule 
gives  the  a  posteriori  probability 


Pd  Pi 
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If  a  detection  does  not  occur  as  a  consequence  of  the  cell  being  observed  then 


/^r'(0)=- 


(i-p.)A' 


H+l 


Cells  that  are  not  observed  are  not  updated,  as  cells  are  assumed  independent  in  the 
localised  formulation. 


5.2  Target  motion  prediction 

This  gives  the  a  pnori  distribution  at  time  C+i  {(n+lf  epoch)  given  the  a  posteriori 
distribution  p"  at  time  due  to  non-deterministic  target  motion. 

A  Markovian  target  motion  model  is  defined  as  follows.  Assume  that  the  probability 
that  a  target  is  in  cell;  at  time  given  that  it  was  in  cell  i  at  time  is  Q(i,j,t„^^  -t„). 

Then 

as  the  target  that  is  in  cell  i  at  time  has  be  somewhere  at  time  (Conservation  of 
Probability).  It  will  eventually  be  necessary  to  take  accouunt  of  boundary  conditions  due 
to  targets  drifting  in  and  out  of  the  region  91 .  Then  the  probability  of  a  target  being  in 
cell;  at  time  due  to  possibly  moving  there  from  all  other  cells  since  time  t„  is 

N 

P J  (^n+l  )  ~  Q(}>  j ■>  ^n+1  ~  ')Pi  )  ‘ 

/=1 

This  target  motion  model  is  Markovian  because  the  motion  of  the  target  depends 
purely  upon  where  it  was  located  at  the  last  epoch  and  not  on  where  it  was  located 
prior  to  that  time.  Because  the  times  between  epochs  are  unequal  in  length,  the 
probability  transition  matrix  Q  is  made  explicitly  dependent  upon  the  time  interval 

^n+l  * 

Often  a  Gaussian  distribution  is  chosen  to  represent  the  non-deterministic  motion  of  a 
target  between  observations,  in  which  case  this  would  be  referred  to  as  a  Gauss- 
Markov  target  motion  model  and  Q  would  be  defined  appropriately.  It  will  be  seen 
when  a  target  is  being  tracked  its  velocity  as  well  as  location  need  to  be  estimated  and 
the  state  of  a  target  at  any  time  no  longer  depends  only  upon  location.  However,  its 
target  motion  model  may  stiU  be  considered  Markovian  by  extending  its  state  space  as 
will  be  seen.  Target  motion  models  are  detailed  in  Section  9.1. 


5.3  Sensor  Control  Decisions 

It  is  necessary  to  determine  which  area  cz  91  a  satellite  should  access  at  the 
(«  +  l)*^  epoch  to  maximise  the  information  obtained.  In  order  to  specify  this  as  a 
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quantitative  optimisation  problem  we  use  information  entropy  as  a  measure  of  the 
quality  of  surveillance  information.  Entropy  is  defined  by  the  probabilities  over  a  state 
space.  If  we  take  the  joint  probability  distribution  for  the  numbers  and  locations  of  aU 
possible  targets  in  the  localised  formulation  and  take  its  entropy,  then  it  can  be  shown 
[1]  that  the  entropy  is  simply  the  sum  of  the  entropies  for  the  individual  cells. 
Consequently  one  can  consider  the  consequences  of  a  detection  or  non-detection  for 
each  cell  separately  for  each  cell  inspected.  One  can  predict  the  contribution  to  the  total 
entropy  by  the  i*  cell  immediately  before  any  observations  as 

Ar'  =-pr’iogA”^'  -(i-pr')iog(i-A:-') 

using  the  a  priori  probabilities  obtained  from  the  target  motion  update.  If  one 
were  to  assume  a  perfect  sensor  then  the  consequence  of  inspecting  the  i*  cell,  whether 
a  detection  occurred  or  not,  would  be  to  reduce  its  contribution  to  the  total  entropy  to 
zero.  In  this  idealised  case  the  choice  of  area  would  be  that  for  which  the  total 
contribution  of  all  its  cells  before  observation  was  a  maximum,  which  would  be 
obtained  simply  by  choosing  the  swath  with  the  most  cells.  This  would  require  the 
assumption  that  and  Pj^  =  0 ,  that  is  a  perfect  sensor  with  no  obscuration.  For 

those  cells  not  inspected  there  is  no  change  to  their  contribution  to  the  entropy  due  to 
the  independence  assumption. 

If  we  are  to  accommodate  the  issue  of  sensor  resolution  through  values  of  p^i  <  I  and 
Pj^  >  0  then  we  need  to  look  at  the  expected  change  in  entropy  following  an 
observation.  Independence  of  cells  enables  this  to  be  computed  easily  per  cell. 

Let  (0)  be  the  probability  at  the  (n+2)*  epoch  after  a  hypothetical  non-detection  in 
the  f'  cell  and  p"^'  (1)  be  the  probability  after  a  hypothetical  detection  there.  Then  the 
entropy  in  each  case  is 

hr'  (0) = -pr'  (0)  logpr'  (o)  -  o  -  pf  (o))  logo  -  pr  m) 

and 

hr'  (1) = -pr'  (1)  log  pr'  (o  -  (i  -  pr'  (o)  logo  -  pr'  o)) 

and  hence  the  expected  entropy  is 

E(hr )  =  ?r{0}hr'  (0)  +  Pr  {I}//""'  (1) 
where  the  probability  of  a  detection  is 

Pr  {1}  =  Pr  {detection}  =  p^p”^'  +  p^^  (1  -  p^ ) 
and  the  probability  of  a  non-detection  is 

Pr{0}  =  Pr  (no  detection)  =  1  -  Pr  (detection)  =  1  -  p^p"*'  -  Pfa  (1  “  A”  ' ) 

The  choice  of  control  action  a  is  that  which  maximises  the  expected  change  in  entropy 
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max,  2W*'-£(r')} 

i^A(a) 

where  h."'^  is  the  a  priori  entropy  of  the  f*  cell  at  the  n+1*  epoch  following  target 
motion  update. 

This  sum  of  cell  entropies  will  never  be  reduced  to  zero  in  practice  (corresponding  to 
perfect  information)  because  of  targets  moving  between  observations,  inadequate 
coverage  and  imperfect  sensors,  so  the  Law  of  Diminishing  Returns  applies.  At  some 
point  in  time  therefore  additional  surveillance  effort  will  result  in  no  net  improvement 
in  the  quality  of  information.  This  point  may  never  be  reached  if  the  objective  of  a 
surveillance  operation  is  to  search  for  a  target  (or  targets)  of  a  particular  type  and  then 
locate  them  with  sufficient  accuracy  to  track  them.  Similarly  if  Phase  I  is  limited  in 
time.  Assuming  that  Phase  I  is  not  unlimited  in  time  then  a  condition  will  be  tested  to 
determine  when  Phase  II  begins. 

Of  particular  interest  may  be  the  total  number  of  targets  in  91 ,  irrespective  of  their 
locations,  in  which  case  this  parameter  should  be  estimated.  The  expected,  or  average, 
number  of  targets  in  91  at  the  n*  epoch  is  simply 

N 

Zp: 

1=1 

The  distribution  for  the  number  of  targets  in  91  is 

Pr{A:  targets  in  91}  =  ^  ^  ...  ^  P^Ph-’-PhYl^^'' Pj^ 

>1='  >2=yi+'  y*=A-i+> 

j*j\ 

etc. 

N  N  N  N 

=ilpj.  z  Ph-- X 

71=1  A=A-[+l 

J*J\ 

etc. 

from  which  the  entropy  can  be  computed,  if  required. 


6.  Phase  II:  Locate 

The  search  phase  of  a  surveillance  operation  will  continue  until  there  is  sufficient 
confidence  that  the  number  of  targets  in  the  region  is  known  by  testing  the  entropy  for 
the  hypothesis  for  target  numbers.  This  will  never  reduce  to  zero  due  to  false  alarms, 
non-detections  and  target  motion  but  at  some  point  a  decision  has  to  be  made  to 
advance  to  the  locate  phase  of  a  surveillance  operation.  This  decision  has  to  be  made 
with  reference  to  the  overriding  mission  objective  and  the  desire  to  minimise  the 
entropy  of  the  surveillance  information  requirement  associated  with  it. 
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The  objective  of  the  locate  phase  is  to  determine  the  current  locations  of  targets  whose 
existence  has  previously  been  determined  through  execution  of  the  search  phase.  These 
targets,  once  located,  may  be  tracked  in  the  next  phase,  enabling  other  sensors  to  be 
cued  to  identify  or  classify  them  as  required.  The  probability  distribution  for  the  locate 
phase  is  the  distribution  for  the  target  locations  conditioned  upon  the  hypothesis  for 
their  numbers  in  the  region.  When  the  entropy  for  this  distribution  is  sufficiently  small 
due  to  the  availability  of  improved  estimates  for  target  locations,  their  locations  may  be 
replaced  by  separate  spatial  distributions  for  each  target  if  they  are  distinguishable,  or 
more  simply  by  point  estimates  with  Gaussian  errors  as  is  traditionally  done  in 
tracking. 

We  define  the  probability  distribution  for  the  locations  of  k  targets  in  91  at  the  n* 
epoch  thus: 

P"  Ui  >  7 2  V  • ,  A )  =  Pr  {targets  in  cells  y, ,  72 » •  •  ■ » Jk  I  ^  targets  in  91} 

_  Pr{targetsin  cells  n  A:  targets  in  91} 

Pr{A:  targets  in  91} 

where  the  numerator  is  obtained  from  the  localised  (cell-based)  formulation  using 

Pr{3A:  targets  in  cells  }  =  p"  p}'  Pj  ) 

2=1 

J*J\ 

etc 

and  the  denominator  has  previously  been  given  in  Section  5.3.  The  sum  of  the 
probabilities  over  all  possible  configurations  of  k  targets  is  unity,  i.e., 

Z  Z  -  Z^"0’ . .2.)=i- 

Ji  •  •  Ji  J]  Jl  -Jl  "P*  Jk  -'Jk  -1 

Note  that  the  distribution  for  the  locations  of  k  targets  has  states,  which  can 
become  large  for  more  than  a  few  targets. 

6.1  Bayesian  update  for  location  estimation  of  multiple  targets 

Rather  than  maintain  a  distribution  for  P" update  it  directly  using 
Bayes'  rule  for  the  assumed  number  k  of  targets  (which  requires  specification  of  the 
appropriate  likelihood  function),  it  is  simpler  to  continue  updating  the  individual  cell 
probabilities  separately  (assuming  cell  independence  as  in  the  localised  formulation) 
and  then  condition  on  the  number  of  targets  as  shown  in  the  previous  paragraph.  This 
is  possible  because  the  ordering  of  hypothesis  conditioning  and  Bayesian  updating  is 
immaterial.  The  advantage  of  this  is  that  because  the  localised  formulation  has  a 
smaller  number  of  states,  fewer  Bayesian  updates  have  to  be  performed  (N  as  opposed 
to  ^C,). 
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6.2  Target  Motion  Update 

Again,  it  is  easier  to  perform  the  target  motion  update  in  the  localised  formulation,  as 
in  Section  5.2,  and  then  condition  the  probabilities  on  the  hypothesis  for  the  number  of 
targets  than  to  apply  the  target  motion  update  directly  to  P"  (7, ,  y 2 ,  •  •  • ,  A  )  • 

6.3  Sensor  Control  Decisions 

The  entropy  corresponding  to  the  information  regarding  the  locations  of  an  assumed 
number  k  of  targets  at  the  n*'’  epoch  is 

h-=-J^P\h,h . A)logP"0„A,..-.A) 

h-Jk 

Computation  of  this  expression  results  in  a  measure  of  the  quality  of  the  information 
obtained  about  the  k  targets.  The  entropy  for  the  hypothesis  that  there  exist  exactly  k 
targets  should  also  be  monitored  to  ensure  that  the  hypothesis  is  still  relevant.  In 
practice  multiple  hypotheses  can  be  maintained  and  monitored,  but  only  one  physical 
sensor  action  can  be  selected.  The  choice  of  sensor  action  should  be  to  prove  or 
disprove  a  hypothesis  (e.g.,  that  there  are  k  targets,  as  opposed  to  any  other  number) 
and  to  maximise  the  information  associated  with  a  hypothesis  beUeved  to  be  true  (e.g., 
where  the  assumed  k  targets  are  located).  There  will  almost  always  be  some 
uncertainty  associated  with  a  hypothesis  unless,  or  until,  its  entropy  is  exactly  zero. 

Entropy  may  provide  an  objective  measure  of  information  but  in  practice  the  large 
number  of  potential  states  makes  it  difficult  to  compute.  However  most  states  are  likely 
to  have  very  small  probabilities  associated  with  them  and  so  need  not  be  considered  in 
the  summation.  The  primary  significance  of  this  formulation  is  that  it  provides  an 
objective,  quantitative  approach  from  which  rational  approximations  may  be  derived 
for  implementation. 

The  choice  of  sensor  control  parameter  should  be  based  upon  the  expectation  of  the 
entropy  following  the  observation,  as  before.  Whereas  for  each  cell  observed  only  two 
outcomes  are  possible  for  the  case  of  independent  cells  (a  detection  or  non-detection), 
for  the  present  case  in  which  the  cells  are  not  independent  since  they  are  constrained 
by  the  condition  that  there  are  a  fixed  number  of  targets,  aU  possible  configurations  of 
detections  and  non-detections  resulting  from  a  choice  of  sensor  control  action  (swath 
and  resolution)  have  to  be  considered  in  performing  the  expectation  of  the  entropy. 
Then  the  sensor  control  action  that  yields  the  best  improvement  in  expected  entropy 
has  to  be  fotmd  and  selected.  If  the  number  of  targets  were  small  and  probabilities  of 
false  alarm  small  then  it  would  certainly  be  practical  to  compute.  This  is  demonstrated 
for  the  case  of  a  single  target  {k=l)  in  the  following  section. 

The  optimal  swath  selection  procedure  can  be  found  in  Appendix  C.3. 
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6.4  Single  target  case 

In  the  special  case  of  a  single  target,  a  likelihood  function  can  be  written  for  updating 
the  probability  distribution.  We  re-use  the  notation  for  the  independent-cell  case  but 
interpret  the  probability  distribution  p"  to  mean  the  probability  that  the  single  target 
occupies  the  i*  cell  at  the  epoch  so  that 

N 

Ea'=i. 

/=1 

Suppose  that  at  a  particular  epoch  there  are  M  cells  under  the  swath  A  and  a  set  of  k 
detections  occurring  in  cells  one  or  none  of  which  may  correspond  to 

the  true  target,  the  remainder  being  false  alarms.  Then,  since  we  are  assuming  that  all 
observations  are  associated  with  one  and  the  same  target,  the  probabilities  over  the 
cells  comprising  91  have  to  be  updated  using  Bayes'  rule  everywhere,  whether  a  cell  is 
under  the  swath  or  not.  That  is,  because  observing  a  target  known  to  exist  somewhere 
reduces  the  probability  of  it  being  elsewhere. 

Bayes'  rule  updates  the  probability  distribution  for  the  target  based  on  a  set  of 
observations  in  cells  {jx,j2i---ijk}  as  follows: 

LUxJ2,---Jk\^)Pi 

Y,L{jxJ2,-,jk\l)Pi 

lem 

where  p,  is  the  a  priori  distribution  following  the  target  motion  update.  In  practice,  of 

course,  the  probability  of  false  alarm  will  be  relatively  low  and  the  number  of  false 
alarms  per  sweep  or  scan  will  most  likely  be  small  in  number. 

The  likelihood  function  used  in  Bayes'  rule  has  different  expressions  depending  upon 
whether  cell  i  is  in  A,  or  not  in  A  {L,i, ),  and  on  whether,  if  it  is  in  A,  it  is  one  of  the 
Ui  ■>  j  2^- ■■■>jk)  i^ii)  ( ^/  )•  Note  that  all  of  {y, ,  , . . . ,  )  must  be  in  A  (because  a 

sensor  can  only  obtain  detections  where  it  observes).  The  Bayesian  update  procedure 
can  be  found  in  Appendix  C.5. 

Case  I  corresponds  to  the  situation  m  which  the  target  is  contained  in  A  but  does  not 
belong  to  the  set  of  detections.  So  the  likelihood  function  is  the  probability  that  the 
target  is  not  detected  but  that  there  are  k  false  alarms  in  A  (and  hence  M-k-l  non-false 
alarms). 

Casel:  ieA  but  i  ^  {j\,j2’--->Jk}  k 

L,ij\,j2,--;Jk  \  ^{j\,J2,--;Jk})  =  (^- Pd)Pfa(^- 

Case  II  corresponds  to  the  situation  in  which  the  target  is  contained  in  A  but  does 
belong  to  the  set  of  detections.  So  the  likelihood  function  is  the  probability  that  the 
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target  is  detected  and  that  there  are  k-1  false  alarms  in  A  (and  hence  M-k  non-false 
alarms). 

Case  II:  2  6/1  and  i  g  ior  k  = 

^II  ^j\  ^  Jli'  •  ">  j  k  I  i  G  /I  ^  ^  ^  {y'l )  72  ’  •  •  •  ’  ./i  })  “  PdP  fa  P  fa^ 

Case  III  corresponds  to  the  situation  in  which  the  target  is  not  in  A,  so  all  detections 
must  be  false  alarms  and  the  question  of  whether  the  actual  target  is  detected  or  not 
does  not  arise.  Hence  the  likelihood  function  is  the  probabiKty  of  k  false  alarms  and 
M-k  non-false  alarms. 

Case  III:  i^A  for  k  =  0,...,M 

(7i  =  Pfa  (^-Pfa)^^'‘ 


Note  that  all  of  the  above  assumes  that  the  probabilities  of  detection  and  false 
alarm  (per  cell)  are  spatially  uniform  in  A  and  are  known  functions  of  the  sensor 

resolution,  which,  together  with  the  look  angle,  determine  the  size  of  A.  The 
probabilities  can  in  principle  be  made  spatially  and  temporally  non-uniform  so  to 
represent  environmental  effects. 

The  target  motion  update  is  performed  as  before  to  obtain  the  a  priori  distribution  for 
the  Bayesian  update 

N 

Pj  (^,,+1 ) = X  ’  j'  ) 

/=! 

and  the  entropy  evaluated  over  all  states  is 

The  following  is  an  implementation  of  the  expression  for  the  expected  entropy  for  the 
single  target  case  with  non-zero  false  alarm  rate. 

It  is  proven  in  Appendix  A  that  the  expectation  of  the  entropy  over  all  possible  sets  of 
observations  /  =  {7, ,  72  ’  •  •  • » A  )  ^  ^  epoch,  where  A  is  the  set  of  all  possible 

sets  of  observations,  is 

A =  /2"  -  X  pT'  X  i  0  log  I  '■) + X  . 

i  2eA  /eA 

where  L(l  \  i)  is  the  sensor's  likelihood  function  defined  as  the  probability  of  obtaining 
measurement  set  /  =  given  that  the  target  is  in  cell  i ,  is  the  a  priori 

probability  that  the  target  is  in  cell  2  (following  target  motion  update),  and  (/)  is 
the  probability  of  obtaining  the  measurement  set  / .  Hence 
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ie'n 

The  likelihood  ftmction  depends  upon  the  area  A  swept  by  the  sensor  and  hence  so 
does  the  expected  entropy.  It  also  depends  upon  the  resolution  of  the  sensor  because 
this  eiffects  both  the  probability  of  detection  and  probability  of  false  alarm  The 
intention  should  be  to  choose  the  area  of  coverage  and  sensor  resolution  so  as  to 
minimise  the  expected  entropy.  This  will  involve  a  trade-off  between  area  of  coverage 
and  sensor  resolution.  There  is  complication  caused  by  the  fact  that,  although  we  are 
assuming  a  single  target,  there  are  potentially  a  large  number  of  detections.  However 
the  false  alarm  rate  will  generally  be  small,  which  will  tend  to  limit  the  number  of 
detections  occurring  in  practice.  It  is  possible  to  interpret  the  meaning  of  the  terms  in 
equation  6.1  and  this  is  done  in  Appendix  B. 

The  difficulty  in  evaluating  the  expression  for  the  expected  entropy  lies  in  the  fact  that 
there  are  summations  over  all  possible  numbers  and  configurations  of  observations 
due  to  false  alarms,  although  most  of  these  wiB  have  an  extremely  small  probability. 
However,  to  totally  ignore  the  effect  of  false  alarms  would  be  to  ignore  the  ambiguities 
that  could  occur  when  the  sensor  resolution  is  low.  That  is,  not  only  would  the 
probability  of  detection  be  reduced  but  also  the  possibility  of  obtaining  false  alarms 
could  increase. 

To  make  it  clear  that  the  expected  entropy  is  the  expectation  over  the  numbers  of 
detections  and  their  configuration,  we  rewrite  equation  6.1  by  replacing  I 
by  {y,, 72,. Then 

M  M 

=  I01ogl(7,...7i \i)+Z 

/e'J?  *=0  k-Q 

A/—!  A/  M 

Z^z//iog^/// 

ieA  i=0/e|y,...y*)  ieA  k=1  ieA  k=0 

M 

+E  Zz'U-/.)iogZ’(y,...y.) 

*=0(y,...A) 

M-\ 

= *  -  S  A  E  Eo  -  )  A  (>  -  p/.  )"■*■'  >og{(i  -  A )  A  (1  -  p/.  >*'■*■’ ) 

le/f  *=0iV{y,...yj) 

M 

“  Z  A  Z  Z  ^<z (1  -  (1  -  )"■* } 

UA  i=i /6{y|...y,) 

M 

-  Z  ft  £  E  A  (1  -  Pf. )“  *  '“gfftJ.  0  -  Pf.  )"■* } 

itA  *-0!y|...yj) 

M 

+  Z  Z^(A-"A)log^(7....7*) 

*=o  ivv-y*) 

where 
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=  \i)Pi 

/e9! 

=  /’i  +  ^  -^/Z  Pi  S  ^lll  Pi 

ieA  ieA  ieA 

ie{J,.-jt)  i^iJi-Jk) 

has  separate  expressions  for  the  cases  A:  =  0,0  <  k  <  M  and  k  —  M.  For  k  —  0: 
P(no detections)  =  ^ -i-/ /»,  +  X L,„p^  =  Pd)0^- P P a  P ja)'^ P a) 

ieA  ieA 

where  =  ^ Pi  •  For  0<k  <M  : 

ieA 

/=1  /=1 

and  for  k=M : 

)  =  ^^iiPi  '^^mPi  ~  PdPfa  Pa  +  Pjd  Pa) 

ieA  ieA 

The  expected  entropy  expression  may  further  be  simplified  thus: 

A/-1 

ieA  A=0 
M 

-Zp^z  0  -  )"-*  iog{py-'  (1  -  pp  )"■* } 

ieA  A=1 
M 

(•  -  i'/.  )"'*  > 

ieA  A:=0 
M 

k=0{j,...j\) 

M-\ 

‘h-p.f^^-'CAi-p,  )p‘.  (1  -  P,.  )"'*''  log{(l  -  P,  )p\.  0  -  /)/, )"  ' } 

k=0 

M 

-PaZ  ""^'^k-xPAPp'  i^-Pfa  l0g{P<zP?'  0  -  Pfa 
k=\ 

M 

k=0 

M 

+Z  Z^o\--Jk)^ogPij\...j\) 

k=0 

or. 
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h’  =  h-c,  (p^ ,  Pj„,M)Pa  -  (Pd  =  Pfa  M)Pa  -  c,ii  {Pd^PfaM)^  -  Pa  ) 

M 

It  is  apparent  that  the  expression  for  the  expected  entropy  is  a  series  in  ascending 
powers  of  pj-^  and  that  we  can  decide  to  truncate  the  series  at  any  term  so  as  to 

simplify  computation.  For  instance  we  could  decide  to  retain  terms  in  and  but 

to  discard  terms  in  pj-^  and  above  on  the  grounds  that  they  are  negligibly  small.  Note 

that  care  is  required  here  because  although  the  probability  of  a  particular  pair  of  false 
alarms  may  be  small,  the  probability  of  any  hvo  false  alarms  occurring  somewhere  may 
not  be  insignificant.  In  other  words  we  have  to  take  accormt  of  not  just  the  magnitude 

ofp^,,butof  "Qpi. 


The  procedure  for  computing  the  expected  entropy  change  is  in  Appendix  C.4. 


7.  Phase  III:  Track 


The  localised  formulation  of  Section  5  has  been  used  for  tracking  [3]  but  is,  the  authors 
believe,  fundamentally  flawed  as  weU  as  grossly  inefficient.  This  is  because  detecting  a 
target  with  certainty  at  a  location  does  not  cause  the  probability  of  a  target  existing  in 
its  neighbourhood  to  collapse  to  zero,  even  if  it  was  known  that  there  was  only  a  single 
target  in  that  neighbourhood.  This  is  a  consequence  of  the  fact  that  information  is 
stored  from  a  spatially-oriented  perspective  and  not  from  a  target-oriented  perspective. 
In  particular,  if  a  measurement  is  known  to  be  associated  with  a  particular  target  but 
that  fact  is  not  recorded,  or  not  capable  of  being  recorded  within  one's  information 
model,  then  information  has  effectively  been  discarded. 

The  way  around  this  is  to  associate  a  measurement  with  a  particular  target  and 
maintain  separate  probability  distributions  for  the  locations  of  each  target.  This  process 
of  associating  measurements  with  targets  is  inherent  to  most  trackers  such  as  the  PDA 
tracker  [4]  and  there  is  little  point  in  attempting  to  reproduce  such  a  tracker  here  except 
to  say  that  they  are  probabilistically-based  and  therefore  consistent  with  the  present 
GAMBIT  formalism.  Suffice  it  to  say  that  we  will  assume  for  present  purposes  that  the 
association  between  targets  and  measurements  has  been  perfectly  achieved.  Note  that 
if  there  is  only  a  single  target  within  the  region  of  interest,  or  that  there  are  multiple 
targets  but  measurements  can  be  reliably  associated  with  particular  targets,  then  the 
issue  does  not  arise. 
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7.1  Bayesian  update  for  track  estimate  of  a  single  target 

Location  estimation  using  Bayes'  rule  does  not  generate  point  estimates  from  which 
tracks  can  be  composed;  instead  it  produces  a  probability  distribution  for  the  location 
of  the  target  over  the  region  91 .  To  extract  point  estimates  from  the  probability 
distribution  for  track  formation  purposes  would  be  to  discard  information  and 
therefore  lose  the  benefits  of  a  Bayesian  approach.  We  choose,  therefore,  to  extend  the 
Bayesian  location  estimation  formulation  to  track  formation  so  as  to  continue  working 
with  probability  distributions  and  avoid  having  to  extract  point  estimates.  When 
probabilities  for  track  states  are  found  to  be  negligibly  small  then  they  can  be 
discarded. 

For  present  purposes  a  track  is  defined  as  a  pair  of  locations  representing  both  where 
the  target  is  at  the  latest  epoch  and  where  it  was  at  the  previous  epoch.  We  define  a 
probability  distribution  over  the  state  space  of  all  such  location  pairs  as  follows 

p'!.  =  Pr  {target  moved  from  cell  i  at  epoch  n  - 1  to  cell  j  at  epoch  n] . 

Applying  Bayes'  rule  to  obtain  the  probability  at  epoch  n+2  from  the  probability  at 
epoch  n  based  upon  a  set  of  observations  =  {_/, , ,  •  •  • » 7*  } 


«+i//'<+K  Pr  (target  moves  from  i  to  j  n  observations  r*  } 
Puj  (-  )  -  Pr{r'"'} 

Pr{r"'} 


The  likelihood  function  for  the  set  of  observations  at  epoch  n+1  does  not,  in  fact, 
depend  upon  the  target  location  at  epoch  n,  only  on  where  the  target  is  at  the  instant  of 
observation  at  epoch  n+1.  Hence  we  can  write 


k;’ (/"')= 


L{r'  I  7) Ay 
Pr{/"^'} 


i:Z^(r"'i7)K;' 

ie'Jl  ye'Ji 


where  p'P'  is  the  a  priori  probability  following  the  target  motion  update,  and  the 

likelihood  function  Z,(r‘ '  I  J)  is  the  same  as  the  one  previously  defined  for  estimating 
target  location  in  the  case  of  a  single  target.  The  Bayesian  procedure  is  presented  in 
Appendix  C.5. 


7.2  Target  Motion  Prediction 

A  Markovian  target  motion  model  that  exploits  knowledge  about  estimates  of  the 
target  velocity  can  be  used  to  obtain  the  a  priori  distribution  at  epoch  n+1  from  the  a 
posteriori  distribution  at  epoch  n.  This  is  simply 
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n 


where  the  transition  probabiKty  matrix  Q  now  depends  not  only  upon  where  the  target 
was  at  the  previous  epoch  (ceU  i  at  time  t„)  but  how  it  got  there  (from  cell  i'  at  time 
and  hence  implicitly  upon  its  velocity.  The  target  motion  model  is  therefore  more 
general  than  that  used  for  the  search  and  location  estimation  phases  since  it  takes  into 
account  not  just  the  location  of  targets  but  estimates  of  their  speed  and  direction.  Note 
that  Q  is  also  implicitly  dependent  upon  the  time  intervals  and  . 


To  obtain  the  predicted  distribution  for  location  only  we  sum  over  all  possible  start 
locations  i: 


pr'=SK;'=Zeo' I 


7.3  Sensor  Control  Decisions 

The  expected  entropy  depends  purely  upon  the  sensor,  environment  and  target 
location  but  not  on  the  target  velocity.  Hence  the  same  expression  may  be  used  as  for 
the  single  target  location  estimation  phase  except  that  better  estimates  of  its  location 
are  used  by  virtue  of  the  fact  that  the  target  motion  model  is  implicitly  adapted  to  the 
target's  previous  behaviour.  Therefore  one  would  expect  the  tracking  algorithm  to 
perform  better  than  the  single  target  location  estimation  algorithm. 


8.  Transitioning  Between  Phases 

Since  the  optimal  surveillance  strategies  are  different  for  each  of  the  phases  and  we 
assume  that  the  phases  are  non-overlapping,  a  means  for  determining  when  to 
transition  between  them  is  required.  A  phase  transition  will  occur  when  that  phase  has 
been  "completed",  i.e.,  when  the  required  information  for  that  phase  has  been 
obtained.  So,  for  example,  phase  I  ends  when  all  targets  have  been  detected  and  phase 
II  ends  when  all  detected  targets  have  been  located  sufficiently  accurately  to  be 
tracked.  In  reality,  however,  the  surveillance  information  requirement  wiU  never  be 
perfectly  satisfied  because  the  targets  are  moving  and  the  information  is  continually 
degrading.  A  point  will  be  reached  at  which  additional  effort  makes  little  or  no 
difference  to  the  quality  of  information  obtained.  By  quantifying  the  information 
obtained  during  each  phase  and  comparing  it  against  the  information  sought  we  have 
the  means  of  monitoring  the  effectiveness  of  the  surveillance  and  deciding  when 
additional  effort  will  result  in  diminishing  returns.  When  d\is  occurs,  effort  is  better 
expended  on  the  next  phase  of  the  surveillance  operation. 

The  surveillance  information  requirement  is  specified  as  a  probability  distribution  over 
a  suitably  defined  state  space  and  its  effectiveness  measure  as  the  entropy  of  that 
distribution.  The  strategy  selected  for  a  given  phase  should  aim  to  minimise  the 
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information  uncertainty  and  hence  the  entropy.  When  the  entropy,  or  alternatively  the 
rate  of  improvement  of  entropy,  has  reached  a  specified  threshold  then  the  decision 
may  be  taken  to  proceed  to  the  next  phase  of  the  operation.  When  transitioning 
between  phases  one  assumes  that  the  preceding  information  requirement  has  been 
satisfied.  So  for  example  when  transitioning  from  phase  1  to  phase  II  it  is  assumed  that 
all  targets  have  been  detected,  and  when  transitioning  from  phase  II  to  phase  III  it  is 
assumed  that  all  targets  to  be  tracked  have  been  located.  Furthermore,  we  assume  that 
no  new  targets  enter  the  region  of  interest  once  phase  I  begins. 

This  is,  in  fact,  the  concept  of  a  Bayesian  network  in  which  one  reduces  the  complexity 
of  a  joint  state  space  over  all  possible  targets  and  their  attributes  to  a  set  of 
interdependent  hypotheses  in  order  to  manage  the  combinatoriaUy  large  state  space. 
This  improves  manageability  but  at  the  expense  of  reduced  optimality.  In  practice 
there  will  be  uncertainty  associated  with  these  assumptions  so  they  may  be  treated  as 
hypotheses  and  their  probabilities  monitored.  So  for  instance  in  phase  II,  while  the 
sensors  are  being  utilised  to  accurately  estimate  the  location  of  targets  known  to  exist 
from  phase  I,  they  will  not  be  revisiting  areas  in  which  targets  were  not  found.  Targets 
could  infiltrate  tihose  areas  over  time  so  the  target  motion  model  will  tend  to  reduce 
confidence  in  the  hypothesis  regarding  the  total  number  of  targets  in  the  region. 


9.  Implementation 

Phase  III  of  the  surveillance  operation  was  simulated  in  order  to  assess  the  potential 
improvements  in  tracking  capability  from  implementation  of  the  GAMBIT  algorithm. 
The  problem  to  which  this  technique  is  applied  consists  of  a  region  of  interest  (ROI) 
with  a  single  target  moving  diagonally  across  it;  the  ROI  periodically  falls  within  the 
field  of  regard  of  four  non-overlappmg  swaths  that  notionally  belong  to  one  or 
possibly  more  satellites. 

Phase  I  (search  phase)  has  been  addressed  within  a  different  operational  context  in 
Berry  [5].  The  procedure  for  implementing  tracking  is  presented  in  Appendix  C.l. 

9.1  Target  Motion  Models 

9.1.1  Dead  Reckoning 

If  the  target  speed  v  is  measured  to  within  ±  Av  and  its  heading  0  to  within  ±A0,  we 
assume  it  continues  along  a  straight  line  path  and  estimate  its  position  after  a  time  f,.  to 
be  contained  within  a  segment  of  an  anulus  between  the  circles 

(x  -  X,. )'  +  (y  -  y,.  J  =  [(v  -  Av)/,.] ' ,  and 
(x  -  X,. )'  +  (y  -  y,  Y  =  [(v  +  Av)/,] ' 
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and  bounded  by  the  two  lines 

(y-y,.)=tan(-A^X^-x,.), 

(y-J^,)  =  tan(A^X^-x,). 

9.1.2  Gauss-Markov 

An  explanation  of  the  Gauss-Markov  motion  model  in  greater  detail  can  be  found  in 
Fogg  [6].  Let  n  denote  the  n*  epoch,  and  be  the  number  of  time  steps  {At )  between 
the  n*  epoch  and  the  (n-1)*  epoch.  Assume  that  the  target  is  at  position  at  time 

zero.  Furthermore,  assume,  that  the  x  and  y  components  of  the  position  are 
independent,  and  the  accelerations  in  each  are  white  Gaussian  sequences,  ,  with  the 
same  variance  Accelerations  are  constant  over  a  time  step  At.  Denoting  the 
sequences  of  the  x-  and  y-components  of  the  target's  position  as  {x:,, }  and  } ,  then 
these  are  Gauss-Markov  and  the  equations  of  motion  are  2'’'^  order.  For  {x„}  these 
equations  are  as  follows  (those  for  {y^}  are  similar): 

+  m„,^Atx„  +^{m„^^Atf£„ 

where  is  the  target's  x-component  of  velocity  at  epoch  n. 


The  conditional  probability  of  finding  the  target  at  (jc„,y„)  at  a  time  m^At  after  it  was 
at  (A:„_,,y„_,)  is  denoted  by  p^x„,y„)\  , y„_i )}  and  is  known  for  this  type  of  model 

to  be  bivariate  Gaussian  with  jc„  having  mean 


and  variance 


/“x  =  Vi  +x„_,m„At 


2  Ami  —m„  4 

cr^  = — = - ^^At\ 

12 


y„  has  a  similar  expression  for  the  mean  and  an  identical  variance  [10].  Thus, 
p{(xn . ) I k,-i , y„-i )}  =  exp  ^ {(x„ -pj  +{y„-Py)]  . 


The  value  of  ^  is  a  priori  information,  which  is  determined  by  the  target's  known 
motion  over  time  relative  to  the  chosen  time  increment  At. 

The  Gauss-Markov  Motion  update  procedure  can  be  found  in  Appendix  C.2. 
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9.1.3  Random  Course  Change 

In  this  model  a  ship  is  assumed  to  continue  on  its  estimated  course  at  constant 
estimated  speed  with  at  most  one  change  in  course  at  a  random  chosen  instant  of  time, 
with  the  course  change  angle  chosen  from  a  specified  distribution.  Note  that  this  may 
seem  to  suggest  that  we  begin  with  a  deterministic  start  point  and  velocity  (track)  but 
the  model  is  implemented  for  all  possible  start  locations  and  velocities,  weighted 
according  to  the  probabilities  calculated  from  the  Bayesian  update.  We  assume  that  the 
target  is  at  the  origin  at  observation  time  (epoch)  and  that  the  next  observation  time 
is  .  Without  loss  of  generality  we  align  the  direction  of  motion  with  the 

Cartesian  coordinate  system  as  indicated  in  Figure  1  and  assume  the  target  speed  is  s. 

Each  point  is  uniquely  specified  by  the  pair  of  variables  (x,^)  or  {z,0)  so  there  is  a  1- 
to-1  mapping  between  them.  In  particular  for  any  given  point  in  the  (x,y)  -plane,  there 
is  only  one  way  of  getting  to  it  from  the  origin  in  terms  of  distance  z  before  a  course 
change,  and  angle  0 .  Depending  upon  the  distributions  for  z  and  0 ,  some  points  may 
be  inaccessible.  In  particular  points  distant  greater  than  sT  from  O  are  inaccessible  in 
time  T . 

We  begin  by  writing  the  transformation  between  {x,y)  and  (z,0)  .  This  is  simply 

X  =  (sT  -  z)sin0 
y  =  z  +  (sT-z)cos0 

Next  we  derive  the  p.d.f.  (probability  density  function)  for  the  probability  of  reaching  a 
point  defined  in  terms  of  (z,0)  from  O.  We  treat  z  and  0  as  random  variables  and 
suppose  that  they  are  independent.  Then  the  joint  p.d.f.  for  z  and  0  can  be  written 

Pr  {reaching  element  (z, z  +  dz)  x  (0,0  +  d0)}  -  p(z,0)dzd0  -  (z)/^ (0)dzd0 . 

The  function  Pg(0)  may  be  arbitrary  but  should  be  chosen  so  as  to  reflect  reality  as 
closely  as  possible.  It  could  even  be  spatially  dependent  to  take  account  of  straits 
between  islands  for  example.  The  function  P^  (z)  needs  to  be  chosen  with  some  care, 
however,  because  the  time  of  an  observation  is  statistically  uncorrelated  with  the 
time  t,^+zl s  of  a  course  change.  The  simplest  assumption  is  to  make  the  time 
between  course  changes  negatively  exponentially  distributed.  Thus 

where  X  may  be  interpreted  as  the  average  rate  of  course  changes  per  unit  distance. 

Finally,  the  p.d.f.  has  to  be  converted  into  (x,y)  coordinates.  This  requires  the 
determinant  of  the  Jacobian  of  the  coordinate  transformation  as  follows: 
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Pr  {reaching  element  (x,  x  +  dx)  y~{y,y  +  dy)}  =  p{x,y)dxdy 

^  pix(z,0),y(z,0))\ 
=  p(z,0)dzd0 


Hence 


where 


d{x,y) 


d(z,0) 


Pix,y)  = 


p{z,0)  _PA^)Pe{0) 


d{x,y) 

d{x,y) 

d{z,0) 

d{z,0) 

8ix,y) 

dxidz 

dx/d0 

d(z,0) 

dy/dz 

dyld0 

=  {sT -z)(}-cos0) 


A  y 


\dzd0 
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10.  Performance  and  Comparison 


10.1  General  Variables 

At  this  stage  of  the  development  of  the  Bayesian  tracker,  the  algorithm  is  tested  over  a 
relatively  small  region  of  interest  (ROI)  due  to  its  memory  and  computational 
requirements.  If  the  ROI  has  NxN  cells,  then  the  total  number  of  cells  that  must  be 
stored  is  NxNxN^  as  where  the  target  is  expected  to  be  for  the  next  observation  is 
conditioned  on  where  it  was  after  the  last  observation. 

Ideally  we  want  the  ROI  to  span  the  entire  area  where  a  target  could  exist  and  also 
have  a  high  resolution  (i.e.,  large  number  of  cells  per  given  area  target  size),  but  a  ROI 
satisfying  both  of  these  requirements  would  be  impossible  to  store  on  computer.  So  we 
choose  a  ROI  size  of  16x16  enabling  us  to  obtain  Monte  Carlo  statistics  on  the 
performance  of  various  tracking  methods  in  a  reasonable  amount  of  time. 

To  reduce  the  effect  of  "motion-leakage"  or  "probability-leakage"  across  the 
boundaries  of  the  ROI,  two  layers  of  "buffer"  cells  are  placed  around  the  entire 
perimeter  of  the  ROI,  allowing  these  probabilities  to  be  accotmted  for,  to  a  certain 
extent,  beyond  the  area  covered  by  the  ROI.  The  probabilities  within  these  buffer  cells 
allow  us  to  compute  the  probability  flow  of  a  target  back  into  the  ROI.  For  any 
"leakage"  beyond  these  buffer  cells,  however,  a  single  variable  is  used  to  retain  the 
total  probability  that  the  target  is  neither  in  the  ROI  nor  the  buffer  cells.  This  enables 
the  Bayesian  update  to  be  computed  correctly  since  all  probabilities  are  accounted  for. 
Thus,  the  overall  selection  of  the  size  of  the  ROI  is  a  compromise  on  having  enough 
cells  of  sufficient  resolution  to  cover  a  desired  target  area  and  also  having  enough 
buffer  cells  to  minimize  the  effect  of  probability  overflow,  whilst  minimizing  the 
memory  and  computational  demands  made  on  the  computer  by  having  too  many  cells. 


10.2  Target  Motion 

In  order  to  test  the  basic  utility  of  the  expected  entropy  algorithm,  a  relatively  simple 
target  motion  trajectory  has  been  chosen,  and  is  shown  in  Figure  2  below.  The  target 
begins  at  an  (x,y)  position  of  (l,l),  moves  diagonally  down  until  halfway,  then  makes 
a  course  change,  and  eventually  moves  diagonally  down  to  the  bottom  left-hand  comer 
cell.  A  second  motion  trajectory  is  shown  in  Figure  3,  where  the  target  continues 
beyond  the  halfway  point,  but  now  changes  direction  at  cell  (l0,10).  This  trajectory 
ensures  that  the  target  does  not  change  direction  at  the  boundary  of  two  swaths,  which 
are  defined  later. 

We  assume  that  the  sensor  swaths  access  the  ROI  once  per  time  unit  or  epoch;  the 
target  speed  is  then  chosen  such  that  the  target  moves  to  a  different  cell  for  each  epoch. 
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The  motion  is  always  linear  except  for  one  course  change.  This  motion  not  only  tests 
the  ability  of  the  various  swath  selection  techniques  to  track  the  target  as  it  moves  in  a 
linear  fashion,  but  also  whether  they  can  retain  or  reacquire  access  to  the  target  once  it 
makes  a  drastic  change  in  heading. 


X 


Figure  2:Target  motion  trajectoiy.  The  target  moves  from  tire  upper  left-lmnd  comer  to  the 

lower  left-hand  comer. 
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Figure  3:  Second  target  motion  trajectory.  Tire  target  moires  from  the  upper  left-hand  corner 

towards  the  lower  left-hand  comer. 


10.3  Sensor  Swaths 

Throughout  these  experiments  we  use  only  4  identical  swaths  as  illustrated  in  Figure  4. 
These  are  non-overlapping  but  together  cover  the  entire  ROl.  Each  swath  has  a  width 
of  4  cells  and  a  length  of  16  cells.  We  assume  that  the  sensors  are  also  identical,  namely 
the  probabilities  of  detection  and  false  alarm. 
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Swath  1  Swath  2 


Figure  4:  The  four  sxoaths  (displayed  as  black  strips)  used  in  tlxe  experiment.  Each  is  4  cells  xoide 
and  16  cells  long,  xoith  identical  probabilities  of  detection  and  false  alarm. 


10.4  Alternative  Swath  Selection  Techniques 

This  paper  describes  and  derives  a  method  of  optimally  selecting  a  swath  for  target 
tracking  by  choosing  the  swath  for  each  access  that  minimizes  the  expected  entropy 
across  the  ROI.  Here,  we  discuss  three  other  techniques  that  can  be  used  to  select  the 
swath.  They  are  the  random,  maximum  probability,  and  maximum  sum  of  probability 
methods. 

10.4.1  Random  Selection 

This  is  the  simplest  swath  selection  technique.  At  each  epoch,  we  randomly  select  one 
of  the  four  swaths  displayed  in  Figure  4.  Thus,  each  swath  has  a  Vi  chance  of  being 
selected  during  any  access  period  or  epoch. 

The  results  from  the  random  swath  selection  technique  can  be  considered  as  the 
baseline  performance,  as  it  is  the  simplest  possible  selection  technique  if  we  had  no 
information  on  where  the  target  is  and  where  it  is  heading.  If  we  did  have  this 
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information  then  it  is  a  simple  matter  to  choose  the  appropriate  swath,  thereby 
resulting  in  excellent  performance. 

10.4.2  Maximum  Probability 

In  this  technique  the  swath  that  covers  the  cell  with  the  largest  probability  is  chosen. 
This  is  intuitive,  and  mathematically  can  be  expressed  as: 

Select  swath  5',.  such  that:  j]qax  (Py  '  1  l3:rgest  for  i  =  1,2, 3, 4 . 

10.4.3  Maximum  Sum  of  Probability 

This  technique  is  similar  to  the  maximum  probability  one,  except  that  we  choose  the 
swath  with  the  greatest  sum  of  probabilities  from  all  the  cells  lying  imder  the  swath. 
Thus, 

Select  swath  such  that:  ^  is  largest  for  i  - 1,2,3,4 . 

The  problem  with  the  three  techniques  described  above  is  that  they  do  not  take  into 
account  the  non-idealities  of  the  sensor  i.e.,  they  are  independent  of  the  sensor's 
probabilities  of  detection  and  false  alarm.  The  Bayesian  technique,  however,  can 
incorporate  such  sensor  information. 

10.5  Results 

To  run  the  simulation  requires  a  number  of  variables  to  be  set.  These  are  presented  in 
Table  1.  The  variable  Runs  refers  to  the  number  of  Monte  Carlo  simulations  performed 
to  obtain  the  performance  statistics.  N  is  the  width  (and  breadth)  of  the  ROI,  and  Buffer 
is  the  number  of  layers  of  cells  added  to  the  perimeter  of  the  ROI  to  reduce  the  effects 
of  probability  overflow,  and  are  the  target's  initial  speed  values  in  the  x  and  y 

directions.  We  do  not  assume  that  these  values  are  known  so  they  are  set  to  0.  is 
the  greatest  possible  speed  that  the  target  could  achieve,  and  is  used  mostly  to  reduce 
the  computational  complexity  of  the  problem.  At  is  a  fixed  time  interval  over  which 
the  target's  acceleration  is  constant,  and  n  is  the  number  of  such  time  interval  between 
successive  accesses  or  epochs.  Throughout  all  these  experiments,  nAtwas  kept  at  a 
constant  value.  The  value  of  ^  determines  the  width  of  the  Gaussian  in  the  target 
motion  model  and  is  varied  in  the  examples  provided,  as  are  the  sensor  characteristics 
of  and  pj-^ . 

Table  1:  Variables  and  tire  corresponding  values  used  in  tire  simulations. 


Variable 

Value 

Runs 

100 
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N 

16 

Buffer 

2 

0 

4 

At 

1/60 

n 

Pd^Pfa 

See  Figures 

To  measure  the  performance  of  the  various  techniques  we  compute  the  total 
normalized  entropy  across  all  cells  of  the  ROI.  i.e., 

JeROI 

where  K  is  a  normalizing  factor  corresponding  to  the  maximum  possible  entropy, 
which  occurs  when  the  probability  distribution  is  uniformly  distributed  across  all  cells. 
For  NxN  cells: 


K^-'Zpj\ogpj=-f^^\og 


y=i 


(  1  ] 

1  9  1  1 

=  -7V'-Vlog 

W) 

1  ' 

=  log7V^ 


Better  performance  is  indicated  by  a  smaller  normalized  entropy  value  over  the  ROI. 
From  here  on  all  entropy  values  quoted  are  the  normalized  values. 


Figure  5  shows  a  comparison  of  the  four  possible  swath  selection  techniques  for  the 
target  motion  as  illustrated  in  Figure  2  for  values  of  ^=12.5,  =0.95  and 

=0.001.  The  other  parameters  used  in  the  simulation  are  given  in  Table  1.  The 
entropy  values  are  averaged  over  the  total  number  of  rvms  {“Runs"). 


Overall  the  random  selection  technique  i.e.,  the  baseline  method,  appears  to  be  the 
worst,  while  the  other  three  selection  methods  appear  to  give  essentially  the  same,  but 
much  better,  performance. 


For  the  first  few  epochs  the  expected  entropy  and  maximum  sum  of  probability 
methods  are  worse  than  the  other  two,  as  they  are  affected  by  the  probability  of 
leakage  out  from  the  ROI  and  into  the  buffer  cells.  This  leakage  does  not  allow  them  to 
choose  swath  1  (which  would  cover  the  target  for  the  first  few  epochs).  The  random 
and  maximum  probability  methods  (for  these  epochs)  are  both  unaffected  by  leakage, 
hence  they  can  select  swath  1,  resulting  in  a  detection  event  and  smaller  overall 
entropy. 
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Notice  that  there  are  two  "spikes"  at  epochs  5  and  9,  and  to  a  lesser  extent  at  epoch  13, 
due  to  the  target  probability  distribution  crossing  over  from  one  swath  to  another, 
hence  the  spreading  of  this  distribution  over  two  adjacent  swaths.  Selecting  either 
swath  does  not  cover  the  entire  probability  distribution;  thus  the  entropy  momentarily 
goes  up,  but  all  three  eventually  lock  onto  the  target  and  reduce  the  entropy  to  almost 
zero. 


Epoch 


Figure  5:  Comparison  of  the  swath  selection  techniques  for  the  target  motion  path  shown  in 
Figure  2,  for  ^  =  12.5  and  =  0.95 ,  Pf^  =  0.001 . 


Figure  6  shows  a  comparison  between  the  four  techniques  for  the  motion  shown  in 
Figure  3  with  parameter  values  of  =  30 ,  p^  —  0.95  and  p^^  —  0.001 .  Once  again  the 

random  or  baseline  technique  is  overall  the  worst,  whereas  the  other  three  techniques 
are  generally  the  same  and  reach  almost  zero  entropy  after  6  epochs.  The  random  and 
maximum  probability  selection  techniques  are  better  for  small  epochs,  for  the  reasons 
discussed  above.  There  is  also  a  spike  in  the  entropy  at  epoch  5  when  the  target 
distribution  is  distributed  over  swaths  1  and  2.  The  three  techniques  are  able  to  lock 
onto  the  target  and  reduce  the  entropy  to  almost  zero.  There  are  also  small  spikes  at 
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epoch  10  when  the  target  changes  direction,  and  epoch  13  when  the  probability  is 
spread  over  two  swaths.  Nevertheless,  the  increase  in  entropy  is  very  small. 


Figure  6:  Comparison  of  the  swath  selection  techniques  for  the  target  motion  path  shoion  in 
Figure  3  for  ^  =  30  and  =  0.95 ,  p fa  =  0.001 . 


Finally,  Figure  7  is  for  a  similar  scenario  to  that  of  Figure  6,  but  with  a  more  non-ideal 
sensor:  pj  =  0.80  and  pj^  =  0.01 .  Again  the  random  swath  selection,  or  baseline, 

technique  is  the  worst,  while  the  other  three  are  very  similar.  In  this  case  there  is  a 
broad  spike  visible  at  epoch  13  when  the  target's  probability  is  distributed  over  swaths 
2  and  3. 

The  very  non-ideal  nature  of  the  sensor  and  the  broad  Gaussian  motion  model 
(^  =  30),  results  in  the  techniques  taking  many  epochs  to  reduce  the  entropy  and  even 
then  the  minimum  is  not  close  to  zero. 
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Epoch 

Figure  7:  Comparison  of  tite  swath  selection  techniques  for  the  target  motion  path  shoxon  in 
Figures,  for  ^  =  30and  =0.80,  pj^  =0.01. 

The  results  above  indicate  to  us  a  number  of  points:  firstly,  the  expected  entropy 
algorithm  developed  in  previous  sections  of  this  report  can  track  a  target,  albeit  in  a 
simplified  scenario  for  the  time  being.  Secondly,  selecting  the  optimal  swath  based  on 
the  expected  entropy  gives  results  as  good  as  a  number  of  other  intuitive  techniques, 
and  much  better  than  the  baseline  performance  of  randomly  selecting  a  swath. 
Although  more  complicated  than  the  other  two,  it  is  the  only  technique  that  can  take 
into  account  the  non-ideal  nature  of  the  sensors. 

11.  Demonstration  of  Benefits  for  Analytically 
Tractable  Example 

The  inconclusiveness  of  the  previous  results  has  prompted  some  analysis  of  an 
analytically  tractable  example.  The  fimdamental  question  to  be  answered  is  whether, 
for  a  given  a  priori  probability  distribution,  the  use  of  expected  entropy  as  a  measure  of 
surveillance  performance  for  optimising  resource  allocation  for  space-based 
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surveillance  assets  causes  an  extremal  choice  of  sensor  resolution  (such  as  maximum 
area  /  minimum  resolution  or  minimum  area  /  maximum  resolution)  that  could  be 
arrived  at  by  the  application  of  simple  rules,  or  an  intermediate  choice,  which  requires 
computation  to  determine. 

Fundamentally,  the  balance  between  the  second  and  third  terms  in  the  r.h.s  represents 
the  trade-off  between  sensor  area  coverage  and  sensor  resolution  of  the  expected 
entropy  expression: 

*’  =  *  -  Z  A  I ')  'osi*'  I  '■)  +  log  P(l) 

i  I  / 

Note  that  the  second  term  is  positive  and  has  a  tendency  to  increase  entropy  whereas 
the  third  term  is  negative  and  has  a  tendency  to  reduce  it.  The  simplest  way  of 
exploring  this  issue  is  in  the  localised  formulation  because  each  cell  has  only  two  states 
(making  the  likelihood  function  and  entropy  computation  easy)  and  the  entropy  for  the 
joint  distribution  over  all  cells  is  simply  the  sum  of  the  entropies  for  the  individual 
cells.  This  can  be  explored  without  using  simulation  by  assuming  a  prior  distribution 
and  computing  the  expected  entropy  for  a  choice  of  sensor  resolutions  (and 
corresponding  coverages).  Note  that  in  the  localised  formulation,  cells  are  independent 
so  the  result  of  inspecting  cells  does  not  affect  cells  not  inspected.  In  other  words,  the 
total  change  in  expected  entropy  only  involves  a  summation  over  those  cells  that  are 
inspected. 

In  the  remainder  of  this  section  we  illustrate  the  potential  of  the  developed  algorithm 
to  select  an  "optimal"  swath  mode,  or  position,  from  a  suite  of  on-board  sensors.  One 
would  hope  that  the  swath  chosen  by  the  algorithm,  which  is  the  swath  that  minimises 
the  entropy  over  the  ROI,  results  in  a  natural  trade-off  between  resolution  (cell  area) 
and  overall  swath  size.  Ideally,  the  resolution  should  be  high  as  possible  i.e.,  small  cell 
area,  thereby  increasing  the  probability  of  detection  and  decreasing  the  false  alarm  rate 
(FAR).  Furthermore,  a  large  swath  size  over  the  ROI  is  desirable,  as  this  increases  the 
likelihood  that  the  target  lays  within  the  swath  during  a  satellite  overpass. 

Generally  these  two  desirable  features  cannot  be  achieved  simultaneously  with  most 
sensors  i.e.,  selecting  a  sensor  with  high  resolution,  or  small  cell  area,  results  in  a 
smaller  swath  area  than  a  sensor  with  coarser  resolution.  From  the  specifications  in 
"RADARSAT  Illuminated"  this  appears  to  be  a  reasonable  assumption.  Thus,  there  is  a 
trade-off  between  the  two,  which  we  expect  the  algorithm  to  determine  in  some  way. 

To  illustrate  this  consider  the  following  simple  example  of  choosing  between  four  of 
RADARSAT's  modes:  HI,  SI,  W1  and  LI.  The  Table  below  gives  their  cell  area,  swath 
width  and  area,  and  the  approximate  total  number  of  cells  within  the  swath. 
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Table  2:  Details  of  tlie  four  sensor  modes  used  (see  "RADARSAT  Illustrated"). 


Mode 

Cell  Area  (m^) 

Swath  Width  (km) 

Swath  Area  (m^) 

Total  Cells 

HI 

535 

75 

5.63x10® 

1.052x10® 

SI 

702 

100 

1.00x10'° 

1.425x10® 

W1 

959 

165 

2.72x10'° 

2.840x10® 

LI 

980 

170 

2.89x10'° 

2.949x10® 

The  following  two  figures  show  the  swath  width  (SW)  and  total  number  cells  (N) 
plotted  against  the  cell  area  (A)  as  circular  points.  Two  quadratic  curves  are  fitted  to 
the  data,  resulting  in  the  following  relationship: 

SW  =  0.00023 \5A^-0. 135443 U  +  8 1 . 1 669018  km, 

N  =  73.61 9^1^  -  68344.966,4  +  25998324.264 . 


Figure  8:  Data  and  fitted  curve  of'Sioath  Width'  versus  'Cell  Area'. 
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To  determine  how  each  sensor  mode  or  swath  reduces  the  total  entropy,  one  needs  to 
compute  the  chemge  in  entropy  per  cell,  once  the  probabilities  of  detection  and  false 
alarm  are  estimated.  We  take  a  simple  approach,  which  allows  us  to  illustrate  the 
principle  without  delving  unnecessarily  into  the  complex  modeling  aspects  of  sensors 
and  the  radar  processing  of  the  returns. 


We  assume  that  the  probability  of  detection  is  a  function  of  the  swath  cell  area  covered 
by  the  target.  The  probability  of  detection  is  computed  from  a  sigmoid  function  with 
the  free  variable  being  the  fraction  of  the  cell  area  covered  by  the  target: 


u  = 


2X 


-1, 


pA^)= 


I +  6" 


where  Ay  is  the  target's  area,  A  is  the  swath's  cell  area,  and  a  is  the  slope  of  the 
function  at  w  =  0 . 


For  this  experiment  a  value  of  o:  =  5  was  used.  The  FAR  is  based  upon  the  cell  area 
multiplied  by  the  density  function  of  the  probability  of  false  alarm.  If  a  cell  has  area  Aq 
and  FAR  of  ,  then  the  FAR  of  a  cell  with  area  A  is  given  by 

Pfa  ~  j  PfaO- 
A) 
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In  our  case,  a  nominal  FAR  of  1 was  assumed  for  a  cell  area  of  500  m^,  and  so  the 
FAR  for  the  four  modes  were  computed  from  their  relative  cell  area  values.  The 
probabilities  of  detection  and  false  alarm  for  the  four  sensor  modes  are  given  below  for 
a  target  area  of  600  m^. 


Table  3;  Computed  values  of  probabilities  of  detection  and  false  alarm. 


Mode 

Pe 

Pfa 

HI 

.9980 

1.068x10'® 

SI 

.9720 

1.404x10'® 

Wl 

.7796 

1.916x10'® 

LI 

.7544 

1.960x10'® 

Once  pj  and  are  estimated,  the  change  in  entropy  per  cell  can  be  computed  using 

equation  6.1  from  Section  6.4,  assuming  that  the  probability  of  a  target  existing  in  a  cell 
is  /?  =  0.70 .  The  total  entropy  change  in  any  given  swath  is  the  total  entropy  change 
per  cell  multiplied  by  the  number  of  cells  in  that  swath,  and  its  magnitude  is  denoted 
by  7V|A/z|. 

The  figure  below  shows  a  3-D  line  plot  of  how  the  swath  width  and  total  entropy  vary 
as  the  cell  area  varies.  The  four  circles  represent  the  actual  data.  The  solid  line  is  the 
total  entropy  change,  while  the  dotted  line  is  the  projection  of  this  onto  the  'Swath 
Width'  and  'Area'  plane.  The  solid  line  is  the  plot  of  the  values  obtained  from  the  fitted 
curves  described  above. 

Although  there  are  only  four  points,  for  the  particular  parameters  and  approximations 
chosen,  there  is  a  peak  in  the  total  entropy  change  curve.  The  entropy  method  teUs  us 
that  the  "optimal"  ceU  area  is  somewhere  between  850m2  and  900  m^.  Of  course,  the 
four  RADARSAT  modes  do  not  have  a  cell  area  exactly  equal  to  this,  so  the  mode  with 
the  closest  cell  area  is  chosen,  in  this  case  mode  Wl. 
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Figure  10:  Plot  of 'Total  Entropy  Change'  vs.  'Swath  Width'  and  'Cell  Area'.  Tire  dotted  line  a 
projection  of  the  solid  line  onto  the  'Sxvath  Width'  and  'Cell  Area'  place. 


12.  Conclusions 


This  report  provides  optimal  solutions  to  the  problem  of  dynamically  determining  the 
allocation  and  control  of  space-based  surveillance  resources  for  the  purpose  of 
detecting,  locating  and  tracking  surface  maritime  targets  using  the  GAMBIT  formalism 
[Berry  &  Fogg  (1)].  The  sensors'  performances  were  modelled  in  terms  of  their 
probabilities  of  detection  and  false  alarm.  Three  types  of  target  motion  model  were 
considered,  deterministic,  Gauss-Markov  and  conservative.  Multiple  targets  were 
allowed.  The  solution  progressed  by  cyclical  target  motion  prediction  and  Bayesian 
update.  Significant  computational  efficiency  was  achieved  by  use  of  a  'localised 
formulation'  of  the  problem. 
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The  approach  presented  provides  a  rational  basis  for  space-based  surveillance  asset 
optimisation  which  sets  information  as  the  objective,  incorporates  platform  dynamics 
and  constraints,  sensor  capabilities,  stochastic  effects  and  control  variables  within  a 
single  formulation. 

As  well  as  being  an  issue  of  interest  in  its  own  right,  this  is  an  instance  of  the  more 
general  surveillance  asset  allocation  problem. 

To  illustrate  the  implementation  of  this  process,  the  use  of  SAR  satellites  with  several 
modes  allowing  different  swaths  and  resolutions  was  considered.  Optimal  selection  of 
swaths  for  target  tracking,  by  choosing  the  swath  for  each  access  that  minimizes  the 
expected  entropy  (MOE)  across  the  region  of  interest,  was  demonstrated. 

Three  other  MOE  were  used  to  select  the  swath,  random,  maximum  probability,  and 
maximum  sum  of  probability.  All  were  shown  to  be  better  than  fixed  swath  or  random 
swath  selection. 

It  might  be  expected  that  with  less  ideal  targets  which  change  speed  or  direction  more 
frequently  or  dramatically,  a  Monte  Carlo  simulation  involving  multiple  runs  would  be 
more  discerning  statistically. 

It  was  shown  that  situations  arise  in  which  an  intermediate  rather  than  extreme  choice 
of  sensor  resolution  do  result  in  an  optimal  solution  of  the  surveillance  asset  resource 
allocation  problem,  thereby  providing  justification  for  the  proposed  approach. 


13.  Recommendations 


Further  work  is  required  to  fully  validate  the  concepts  reported  here,  both  in  terms  of 
confirming  and  quantifying  the  benefits  to  space-based  surveillance  effectiveness  and 
in  demonstrating  the  practicality  of  implementation.  In  order  to  achieve  this,  a  detailed 
simulation  model  is  required  which  captures  satellite  dynamics,  ideally  based  upon  a 
high  fidelity  model  (such  as  Satellite  Tool  Kit),  and  a  high  fidelity  SAR  sensor  model 
(such  as  SBST  [9])  is  also  needed.  For  more  general  and  complex  surveillance  asset 
allocation  problems,  which  potentially  involve  constrained  optimisation  of  a  mix  of 
discrete  and  continuous  parameters  over  time,  commercial  optimisation  software  (such 
as  ILOG)  should  be  investigated. 
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15.  Mathematical  Notation 

Set  of  cells  within  5?  that  are  accessed  by  a  swath  (at  epoch  n). 

Area  covered  by  each  cell  in  5H  . 

Likelihood  (•)  summed  over  all  possible  detection  events. 

Likelihood  Lj,  (•)  summed  over  all  possible  detection  events. 

Likelihood  Lju  (•)  summed  over  all  possible  detection  events. 
Combinations  of  taking  k  from  M. 

Statistical  expectation  value. 

Expected  entropy. 

Measured  entropy. 

Entropy  of  cell  i  at  the  n^'’  epoch. 

Entropy  of  cell  i  at  the  n*'<  epoch  after  a  non-detection  event. 

Entropy  of  cell  i  at  the  nf'  epoch  after  a  detection  event. 

Entropy  of  cell  i  at  the  epoch  (immediately  before  (n  -I- 1)*  epoch). 
Number  of  detection  events. 

Set  of  observations  (detections)  {y, ,  v»  7* }  epoch  n. 

Likelihood  fimction. 

Likelihood  of  obtaining  measurement  set  /  given  the  target  is  in  cell  i. 
Likelihood  function  for  a  cell  accessed  by  swath  but  with  no  detection 
event. 

Likelihood  function  for  a  cell  accessed  by  swath  and  with  a  detection 
event. 

Likelihood  function  for  a  ceU  not  accessed  by  swath. 

Number  of  cells  accessed  by  swath  A. 

Number  of  fixed  time  intervals  At  between  successive  accesses. 

Number  of  fixed  time  intervals  At  between  epochs  n  and  (n-1). 

Number  of  cells  in  the  region  of  interest,  91 . 

Epoch  number. 

Probability  density  function. 

Sum  of  probabilities  of  aU  cells  under  the  swath. 

Probability  of  detection. 

Probability  of  false  alarm. 

Probability  of  a  target  being  in  cell  i  at  time  t. 

A  posteriori  probability  distribution  of  target  in  cell  i  at  epoch  n. 

A  priori  probability  distribution  of  target  in  cell  i  at  epoch  ti  (time  ). 
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Probability  that  target  moved  from  cell  i  at  epoch  («  - 1)  to  cell  j  at 

A  priori  probability  distribution  of  target  moving  from  cell  i  at  epoch 
(n  - 1)  to  cell;  at  epoch  n. 

False  alarm  rate  per  unit  area. 

Probability  of  obtaining  th  measurement  set  / . 

Generic  probability  distribution, 
p.d.f.  of  random  variable  z  . 
p.d.f.  of  random  variable  6 . 

-  Probability  that  target  is  in  cell  j  at  time  given  that  it  was  in  cell  i 
at  time  . 

Region  of  interest. 

Maximum  target  velocity. 

Time  from  epoch  0. 

Time  of  satellite  access  to  the  region  of  interest 
Fixed  time  interval. 

Time  between  epochs  and  . 

Cartesian  coordinates. 

Coordinates  after  m  time  steps  At .  Equivalent  to  x{rnAt),y{mAt) . 

RV  for  the  distance  travelled  by  target  before  course  change. 

Sensor  look  angle. 

Average  rate  of  course  changes  per  unit  distance. 

Sequence  of  target's  acceleration. 

Variance  of  target's  acceleration. 

Target's  expected  position  in  Cartesian  coordinates. 

Standard  deviation. 

Set  of  all  possible  observations,  / . 

Random  variable  for  the  target's  course  change. 

}  -  Set  of  k  detection  events. 
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Appendix  A:  Derivation  of  a  general  expression  for 
the  expected  entropy  over  a  set  of  observations 

Assuming  that  there  is  a  single  target  in  91  and  that  observation  in  a  swath  A  results  in 
a  set  of  measurements  i  =  th®^  Bayes'  rule  is  used  to  update  the 

probability  distribution  for  the  target  location  thus 

Pib 

where  P(0  =  ^  I  Op,  is  the  probability  of  the  set  of  measurements 

L  -  occurring.  Let  the  set  of  all  possible  measurements  /  be  A  then  the 

entropy  before  a  set  of  measurements  /  e  A  is 

^  =  -5]  A  log/?,. 

/eiR 

and  after  wards  is 

h'{D  =  -J^p'iogp'. 

/e'Jl 

rl  PiD  J  I  PiD  J 

=  I  OiogL(/ 1 I  Op,  iog/?,  ]+iogP(D 

le'J!  is'Ji 

Hence  the  expected  entropy  over  all  possible  sets  of  measurements  /  e  A  is 

/gA 

= ^  -Z  I  oiog^(/ 1 0 +Z^(^)iog^(D 


after  simplification. 
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Appendix  B:  Interpretation  of  the  expression  for  the 

expected  entropy 


With 

h'  =  h  I  i)\ogL{l  1 0  +  Y.P^l)\ogP{l) 

i  i  i 

the  second  term  on  the  right  hand  side  represents  the  increase  in  entropy  arising  from 
the  imperfections  in  the  sensors.  This  is  because  if  the  likelihood  functions  are  all  I's  or 
O's  then  the  term  vanishes.  This  would  correspond  to  the  case  of  perfect  sensors.  So  this 
term  and  its  sign  are  explainable. 

As  for  the  last  term  on  the  r.h.s.,  this  obviously  represents  a  decrease  in  entropy  because 
of  its  sign.  We  can  understand  this  by  considering  the  case  of  a  perfect  sensor  that  can 
see  everywhere  =  1  and  =0).  Then  the  observation  label  I  corresponds  to  a 

cell  i  and  we  see  that  ^  P(l) log P(l)  =  ^  p.  log p^  =  -h  and  then  h'  =  0.  This  says  that 

/  I 

the  sensor  will  'see'  a  target  in  a  cell  in  proportion  to  the  probability  of  it  being  there 
and  the  entropy  immediately  drops  to  zero  (perfect  information).  Thus,  the  sign  seems 
to  make  sense. 

Now  consider  the  case  of  a  perfect  sensor,  which  cannot  see  everywhere,  but  just  sees  a 
subset  of  all  the  cells  (a  swath).  We  want  to  choose  to  look  in  those  cells  that  make  the 
biggest  contribution  to  the  reduction  m  h.  We  can  only  sum  over  the  states  in  the 

observation  space  in  ^jP(/)logP(/)  i.e.,  over  the  cells  in  which  we  look.  This  is  now 

/ 

closer  to  the  approach  of  summing  probabilities  over  the  cells  and  choosing  the  swath 
with  the  biggest  sum,  or  choosing  the  swath  that  contains  the  cell  with  the  biggest 
probability.  Thus,  we  choose  the  swath  S  that  makes  ^  p^  log  the  most  negative 

ieS 

and  hence  reduces  the  expected  entropy  the  most. 

An  interesting  question  is  whether  maximising  the  change  in  entropy  by  choosing  the 
cells  that  provide  the  biggest  plogp  contributions  is  equivalent  to  maximising  p  or 

Note  that  p\ogp  is  biggest  when  p  is  closest  to  0.5,  however  there  is  a 
constraint  that  all  the  p's  must  sum  to  1  so  it  is  not  obvious  in  general. 

Unlike  the  approach  of  maximising  probabilities  to  find  the  optimum  swath,  this 
entropy  approach  automatically  takes  account  of  the  imperfections  in  the  sensors 
because  of  the  second  term  on  the  r.h.s.  (eg  multipath  effects,  cloud  cover  for  EO 
sensors)  and  hence  is  more  general. 
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Appendix  C:  Implementation  Procedures 

C.l.  Discretized  Tracking  Procedure 

The  following  procedure  explains  the  discretized  tracking  implementation,  which  is 
presented  in  Section  9. 


1.  Initialise  search  and  tracking: 

•  Epoch:  n  =  0  ^ 

•  Cell  probabilities:  1,2,.., 

2.  Compute  target  motion  transition  probabilities: 

•  If  n  =  0 :  compute  Q{j  /  i,  m),  m  =  i  using  algorithm  in  Appendbc  C.2. 

•  Else:  compute  Q{j  /  i,  m),  using  algorithm  in  Appendix  C.2. 

3.  Target  motion  update: 


4.  Select  optimal  swath  by  minimising  the  expected  entropy.  See  Appendix  C.3. 

5.  Update  target's  position. 

6.  Generate  detection  events. 

7.  Bayesian  update.  See  Appendix  C.5. 

8.  Compute  overall  target  probability: 

•  pT'^Zp'j 

i 

9.  Last  epoch  of  simulation? 

•  If  yes,  then  END. 

•  If  no,  then  n  =  n  +  l  and  go  to  2. 
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C.2.  Gauss-Markov  Motion  Update  Procedure 

The  following  procedure  details  the  implementation  of  the  Gauss-Markov  motion 
model  as  described  in  Section  9.1.2.  It  is  called  by  the  main  tracking  procedure  in 
Appendix  C.l,  and  then  returns  to  it  once  it  has  completed  computing  the  probabilities. 


1.  Estimate  Velocity. 

•  Estimate  target's  velocity  v",  ,  i.e.,  target  moved  from  cell  m  at  epoch 
(n  - 1)  to  cell  i  at  epoch  n. 

2.  Predict  Target's  Next  Position. 

•  Predict  target's  position  at  epoch  (n  + 1)  assuming  velocity  v  =  v", .  Let 

the  cell's  x  and  y  coordinates  be  ). 

3.  Compute  Variance. 

•  Compute  the  variance,  ,  of  the  target's  acceleration. 

4.  Compute  Probabilities. 

•  Let  the  p’’  cell's  x  and  y  indices  be  (x^. ,  )•  Compute  the  integral: 


yj+V.iXj+l 


+0.5  JCy+0.5 

T\ 

f  f  exp 

'■} 

J  J 

-0.5  -0.5 

[20-^  ^ 

dxdy 


5.  Return  to  Main  Algorithm,  (i.e..  Return  to  2.  in  main  algorithm  in  Appendix  C.l). 
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C.3.  Optimal  Swath  Selection  Procedure 

The  following  procedure  presents  the  logic  in  computing  the  optimal  swath  that 
reduces  the  expected  entropy;  see  Section  6.3.  This  procedure  is  called  by  the  tracking 
procedure  of  Appendix  C.l,  and  returns  to  it  once  completed.  It  calls  the  procedure  to 
compute  the  expected  entropy  change,  as  described  in  Appendix  C.4. 
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C.4.  Expected  Entropy  Change  Procedure 

The  following  procedure  details  the  process  of  computing  the  expected  entropy  change 
as  described  in  Section  6.4.  It  is  called  by  the  optimal  swath  selection  procedure  in 
Appendix  C.3,  and  also  returns  to  it  once  the  expected  entropy  change  is  calculated. 


1.  Assign  Variables. 

•  M  -  no.  of  cells  in  region  (A)  covered  by  swath 

•  pj  -  Swath's  probability  of  detection 

•  -  Swath's  false  alarm  rate 

•  Pa=YjPc 

ceA 


•  c, 


2.  Compute  Probabilities 

•  Ci^PaY.  0  “  Pd  )Pfi  “  Pd  )Pfa  -  Pfa  } 

k=0 

hi  =Pa'E  ""^t-lPdPfa'  i^-Pfo  Y~'  ^O^PdPT  ^-Pfa  Y'"  } 

k=\ 

=  ^-PA)Yj^CkP%^-P  fa  l0g{p/«  ^-PfaY'''] 

Pl>=fy-Pd%-PfaY~'  PA+^-PfaY^-PA) 

Pm  ^PdPTPA  +pI^-Pa) 

Pk=  P'f^'^-PfaY'''"  ^-Pd)Pfa[pA-YjPj\-^PdP^-Pfa)LPh  -  P  a)p  fa^  -  P  fa) 


•  c 


M-\ 


-  p=/>„iog/>„+/>„iog/>„+2  i;/;iogii 

A=1  UJi-.A} 


3.  Compute  Entropy  Change 

•  A//  =  -c,  -  c„  -  c,„  +  P 

4.  Return  to  Optimal  Swath  Selection  Procedure  in  Appendix  C.3. 
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C.5.  Bayesian  Update  Procedure 

This  procedure  explains  the  Bayesian  update  process  as  described  in  Sections  6.4  and 
7.1.  It  is  called  by  the  main  tracking  algorithm  of  Appendix  C.l,  and  also  returns  to  it 
once  completed. 
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